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(54) PROTEIN AND PREPARATION THEREOF 

(57)Abstract: 

NEW MATERIALA fused protein between a thermolabile enterotoxin B subunit (hereinafter 
referred to as LTB) and a kind of protein different from the thermolabile enterotoxin 
(hereinafter referred to as LT). 

USE: Useful as an immunogen for treatment vaccines or prevention vaccines or useful for the 
treatment of various diseases. The fused protein can be efficiently absorbed from the tissue of 
nasal membrana mucosa. 

PREPARATION: A DNA coding LTB (preferably LTB produced from enterotoxigenic Escherichia 
coli separated from man) is first combined with a DNA coding a kind of protein (e.g. HSV 
surface protein which is a SHV antigen) different from LT to prepare a fused gene. The fused 
gene is linked to the downstream portion of a promoter in a manifestation vector to prepare a 
manifestation vector having a DNA coding the fused protein. A host cell is transformed with the 
manifestation vector and the transformed product is cultured to prepare the fused protein. 
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Leu Gly Ala Val Ho Leu Phe Val Val 
Arg Gly Lys Iyr Ala Leu Ala Asp Ala 
Asn Arg Phe Arg Gly Lys Asp Leu Pro 
Pro Pro Gly Val Arg Arg Val Tyr His 
Pro Phe Gin Pro Pro Ser Leu Pro He 
Glu Arg Ala Cys Arg Ser Val Leu Leu 
Gin He Val Arg Gly Ala Ser Glu Asp 
Leu Thr He Ala Trp Phe Arg Met Gly 
Thr Val Met Glu Tyr Thr Glu Cys Ser 
Cys Pro He Arg Thr Gin Pro Arg Trp 
Ala Val Ser Glu Asp Asn Leu Gly Phe 
Glu Thr Ala Gly Thr Tyr Leu Arg Leu 
Glu He Thr Gin Phe He Leu Glu His 
Tyr Ala Leu Pro Leu Arg He Pro Pro 
Ala Tyr Gin Gin Gly Val Thr Val Asp 
Phe He. Pro Glu Asn Gin Arg Thr Val 
Ala Gly Trp His Gly Pro Lys Ala Pro 
Pro Glu Leu Ser Glu Thr Pro Asn Ala 
Glu Asp Pro Glu Asp Ser Ala Leu Leu 
Ala Pro Gin He Pro Pro Asn Trp His 
Ala Thr Pro Tyr His Pro Pro Ala Thr 
Ala Gly Ala Val Gly Gly Ser Leu Leu 
He Val Tyr Trp Met His Arg Arg Thr 
Arg Leu Pro His lie Arg Glu Asp Asp 
Leu Phe Tyr 
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l 

46 
91 

136 

181 

226 

271 

316 

361 

406 
451 
496 
541 
566 
631 
676 
721 
766 
811 
856 
901 
946 
991 

1036 

ioai 

1126 
1171 
1216 
1261 
1306 
1351 
1396 
1441 



TTT AAA AAG CAG GGG TTA GGG AGT TGT TCG GTC ATA AGC TTC AGC 
GCG AAC GAC CAA CTA CCC CGA TCA TCA GTT ATC CTT A AG GTC TCT 
TTT GTG TGG TGC GTT CCG GTA TGG GGG GGG CTG CCG CCA GGT TGG 
GGG CCG TGA TTT TGT TTG TCG TCA TAG TGG GCC TCC ATG GGG TCC 
GCG GCA A AT ATG CCT TGG CGG ATG CCT CTC TCA AGA TGG CCG ACC 
CCA ATC GOT TTC GCG GCA AAG ACC TTC CGG TCC TGG ACC CGC TGA 
CCG ACC CTC CGG GGG TCC GGC GCG TGT ACC ACA TCC AGG CGG GCC 
TAC CGG ACC CGT TCC AGC CCC CCA GCC TCC CGA TCA CGG TTT ACT 
ACG CCG TGT TGG AGC GCG CCT GCC GCA GCG TGC TCC TAA ACG CAC 
CGT CGG AGG CCC CCC AGA TTG TCC GCG GGG CCT CCG AAG ACG TCC 
GGA AAC AAC CCT ACA ACC TGA CCA TCG CTT GGT TTC GGA TGG GAG 
GCA ACT. GTG CTA TCC CCA TCA CGG TCA TGG AGT ACA CCG A AT GCT 
CCT ACA ACA AGT CTC TGG GGG CCT GTC CCA TCC GAA CGC AGC CCC 
GCT GGA ACT ACT ATG ACA GCT TCA GCG CCG TCA GCG AGG ATA ACC 
TGG GGT TCC TGA TGC ACG CCC CCG CGT TTG AGA CCG CCG GCA CGT 
ACC TGC GGC TCG TGA AGA TAA ACG ACT GGA CGG AGA TTA CAC AGT 
TTA TCC TGG AGC ACC GAG CCA AGG GCT CCT GTA AGT ACG CCC TCC 
CGC TGC GCA TCC CCC CGT CAG CCT GCC TCT CCC CCC AGG CCT ACC 
AGC AGG GGG TGA CGG TGG ACA GCA TCG GGA TGC TGC CCC GCT TCA 
TCC CCG AGA ACC AGC GCA CCG TCG CCG TAT ACA GCT TGA AGA TCG 
CCG GGT GGC ACG GGC CCA AGG CCC CAT AGA CGA GCA CCC TGC TGC 
CCC CTG AGC TGT CCG AGA CCC CCA ACG CCA CGC AGC CAG AAC TCG 
CCC CGG AAG ACC CCG AGG ATT CGG CCC TCT TGG AGG ACC CCG TGG 
GGA CGG TGG CGC CGC AAA TCC CAC CAA ACT GGC ACA TCC CGT CGA 
TCC AGG ACG CCG CGA CGC CTT ACC ATC CCC CGG CCA CCC CGA ACA 
ACA TGG GCC TGA TCG CCG GCG CGG TGG GCG GCA GTC TCC TGG CAG 
CCC TGG TCA TTT GCG GAA TTG TGT ACT GGA TGC ACC GCC GCA CTC 
GGA AAG CCC CAA AGC GCA TAC GCC TCC CCC ACA TCC GGG AAG ACG 
ACC AGC CGT CCT CGC ACC AGC CCT TGT TTT ACT AGA TAC CCC CCC 
TTA ATG GGT GCG GGG GGG TCA GGT CTG CGG GGT TGG GAT GGG ACC 
TTA ACT CCA TAC AAA GCG AGT CTG GAA GGG GGG AAA GGC GGA CAG 
TCG ATA AGT CGG TAG CGG GGG ACG CGC ACC TGT TCC GCC TGT CGC 
ACC CAC AGC TTT TTC GCG A 
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£35 3 133 — JL 

Ala Pro 

g $r Ser Pro Gly Thr Pro Gly Val Ala Ala Ala Thr Gin Ala Ala Asn 
Gly Gly Pro Ala Thr Pro Ala Pro Pro Ala Pro Gly Pro Ala Pro Thr 
Gly Asp Thr Lys Pro Lys Lys Asn Lys Lys Pro Lys Asn Pro Pro Pro 
Pro Arg Pro Ala Gly Asp Asn Ala Thr Val Ala Ala Gly His Ala Thr 
Leu Arg Glu His Leu Arg Asp lie Lys Ala lys Asn Thr Asp Ala Asn 
Phe Tyr Val Cys Pro Pro Pro Thr Gly Ala Thr Val Val Gin Phe Glu 
Gin Pro Arg Arg Cys Pro Thr Arg Pro Glu Gly Gin Asn Tyr Thr Glu 
Gly lie Ala Val Val Phe Lys Glu Asn lie Ala Pro Tyr Lys Phe Lys 
Ala Thr Met Tyr Tyr Lys Asp Val Thr Val Ser GlnVal Tr P Phe Gly 
His Arg Tyr Ser Gin Phe Met Gly lie Phe Glu Asp Arg Ala Pro Val 
Pro Phe Glu Glu Val lie Asp Lys lie AsnAla Lys GlyVal Cys Arg 
Ser Thr Ala Lys Tyr Val Arg Asn Asn Leu Glu Thr Thr Ala Phe His 
Arg Asp Asp His Glu Thr Asp Met Glu Leu Lys Pro Ala Asn Ala Ala 
Thr Arg Thr Ser Arg Gly Trp His Thr Thr Asp Leu Lys Tyr Asn Pro 
Ser Arg Val GluAla Phe His Arg Tyr Gly Thr Thr Val Asn Cys He 
Val Glu Glu Val Asp Ala Arg Ser Val Tyr Pro TyrAsn Glu Phe Val 
Leu Ala Thr Gly Asp Phe Val Tyr Met Ser Pro Phe Tyr Gly Tyr Arg 
Glu Gly Ser His Thr Glu His Thr Ser Tyr Ala Ala Asp Arg Phe Lys 
Gin Val Asp Gly Phe 
Thr Ala Pro Thr Thr 
Ala Trp Asp Trp Val 
Trp Gin Glu Val Asp 
Arg Phe Ser Ser Asp 
Glu Tyr Pro Leu Ser 
Ala Arg Asp Ala Met 
His lie Lys Val Gly 
Leu lie Ala Tyr Gin 



Tyr Ala Arg Asp Leu Th 

Arg Asn Leu Leu Thr Th 

Pro Lys Arg Pro Ser Va 

Glu Met Leu Arg Ser Gl 

Ala lie Ser Thr Thr Ph 

Arg Val Asp Leu Gly As 

Asp Arg lie Phe Ala Ar 

Gin Pro Gin Tyr Tyr Le 

Pro Leu Leu Ser Asn Th 
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555 3 [21 — 22 



i r 

■ s 
s 



Val An Clu Hi i Leu Art Glu Gin Sap An Lys Pro Pro Asn Pro Th 

Pro Pr» Pro Pro Gly Al. Ser Al. A,n Ah Ser V.l Gl» Ars II. Ly 

Thr Th r Sar Ser lie Glu Phe Ala Arg Lou Gin Phe Thr Tyr Asn Hi 

lie Gin Art His Val An Asp Mot Leu Gly Arg Val Ala lie Ala Trp 

Cys Glu Leu Qln Asn His Glu Leu Thr Leu Trp Asn Glu Ale Arg Lyo 

Leu Asn Pro Asn Ala 1 1 e Al a Ser Val Thr Val Gly Arg Are Val Ser 

Ala Arg Met Leu Gly Asp Val Met Ala Val Ser Thr Cys Val Pro Val 

Ala Ala As P Asn Val He Val Gin Asn Ser Met Arg lie Ser Ser Arg 

Pro Gly Ala Cys Tyr Ser Arg Pro Leu Val Ser Phe Arg Tyr Glu Asp 

Gin Gly Pro Leu Val Glu Gly Gin Leu Gly Glu Asn Asn Glu Leu Arg 

Leu Thr Arg Asp Ale He Olu Pro Cys Thr Val Gly His Arg Arg Tyr 

Phe Thr Phe Gly Gly Gly Tyr Val Tyr Phe Glu Glu Tyr Ale Tyr Ser 

His Gin Leu Ser Arg Ala Asp He Thr Thr Val Ser Thr Phe lie Asp 

Leu Asn lie Thr Met Leu Glu Asp His Glu Pha Val Pro Leu Glu Val 

Tyr Thr Arg His Glu lie Lys Asp Ser Gly Leu Leu Asp Tyr Thr Glu 

Vel Gin Arg Arg Asn Gin Leu His Asp Leu Arg Phe Ala Asp lie Asp 

Thr Val He His Ala Asp Ale Asn Ale Ala Met Phe Ala Gly Leu Gly 
Ala Phe Phe Glu Gly Met Gly Asp Leu Gly Are Ala Val Gly Lys Val 

Val Met Gly He Val Gly Gly Val Val Ser Ala Val Ser Gly Val Ser 

Ser Phe Met Ser Asn Pro Phe Gly Ale Leu Ala Val Gly Leu Leu Val 

Leu Ala Gly Leu Ala Ala Ala Phe Phe Ale Phe Arg Tyr Val Met Arg 

Leu Gin Ser Asn Pro Met Lys Ala Leu Tyr Pro Leu Thr Thr Lys Glu 

Leu Lys Asn Pro Thr Asn Pro Asp Ala Ser Gly Glu Gly Glu Glu Gly 

Gly Asp Phe Asp Glu Ala Lys Leu Ale Glu Ala Arg Glu Met lie Arg 

Tyr Met Ala Leu Val Ser Ale Met Glu His Thr Glu His Lys Ale Lys 

Lys Lys Gly Thr Ser Ala Leu Leu S e r Al a Ly s Va 1 Th r As P Met Val 
Met Arg Lys Arg Arg Asn Thr Asn Tyr Thr Gin Val Pro Asn Lys Asp 
Ser Asp Ala Asp Glu Asp Asp Leu 
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1^4 121-1 

I GAGTTGCGCCGCCCG 15 

GACTGCAGCCGCCCGACCTCCGAAGGTCGTTACCGTTACCCGCCCGGCGTAT 67 

ATCTCACGTACGACTCCGACTGTCCGCTGGTGGCCATCGTCGAGAGCGCCCC 119 

CGACGGCTGTATCGGCCCCCOGTCGGTCGTGGTCTACGACCGAGACGTTTTC 171 

TCGATCCTCTACTCGGTCCTCCAGCACCTCGCCCCCAOGCTACCTGACGGGG 223 

GGCACGACGGGCCCCCGTAGTCCCGCC ATG CGC CAG GGC GCC CCC GCG 271 

CGG GGG TGC CGG TGG TTC GTC GTA TGG GCG CTC TTG GGG TTG ACG 316 

i 

CTG GGG GTC CTG GTG GCG TCG GCG GCT CCG AGT TCC CCC GGC ACG 361 

CCT GGG GTC GCG GCC GCG ACC CAG GCG GCG AAC GGG GGA CCT GCC 406 

ACT CCG GCG CCG CCC GCC CCT GGC CCC GCC CCA ACG GGG GAC ACG 451 

AAA CCG AAG AAG AAC AAA AAA CCG AAA AAC CCA CCG CCG CCG CGC 496 

CCC GCC GGC GAC AAC GCG ACC GTC GCC GCG GGC CAC GCC ACC CTG 541 

CGC GAG CAC CTG CGG GAC ATC AAG GCG AAG AAC ACC GAT GCA AAC 586 

TTT TAC GTG TGC CCA CCC CCC ACG GGC GCC ACG GTG GTG CAG TTC 631 

GAG CAG CCG CGC CGC TGC CCG ACC CGG CCC GAG GGT CAG AAC TAC 676 

ACG GAG GGC ATC GCG GTG GTC TTC AAG GAG AAC ATC GCC CCG TAC 721 

AAG TTC AAG GCC ACC ATG TAC TAC AAA GAC GTC ACC GTT TCG CAG 766 

GTG TGG TTC GGC CAC CGC TAC TCC CAG TTT ATG GGG ATC TTT GAG 611 

GAC CGC GCC CCC GTC CCC TTC GAG GAG GTG ATC GAC AAG ATC AAC 856 

GCC AAG GGG GTC TGT CGG TCC ACG GCC AAG. TAC GTG CGC AAC AAC 901 

CTG GAG ACC ACC GCG TTT CAC CGG GAC GAC CAC GAG ACC GAC ATG 946 

GAG CTG AAA CCG GCC AAC GCC GCG ACC CGC ACG AGC CGG GGC TGG 991 

CAC ACC ACC GAC CTC AAG TAC AAC CCC TCG CGG GTG GAG GCG TTC 1038 

CAC CGG TAC GGG ACG ACG GTA AAC TGC ATC GTC GAG GAG GTG GAC 1081 

GCG CGC TCG GTG TAC CCG TAC AAC GAG TTT GTG CTG GCG ACT GGC 1126 

GAC TTT GTG TAC ATG TCC CCG TTT TAC GGC TAC CGG GAG GGG TCG 1171 

CAC ACC GAA CAC ACC AGC TAC GCC GCC GAC CGC TTC AAG CAG GTC 1216 

GAC GGC TTC TAC GCG CGC GAC CTC ACC ACC AAG GCC COG GCC ACG 1261 
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GCG CCG ACC ACC CGG AAC CTG CTC ACG ACC CCC AAG TTC ACC GTG 1306 
GCC TGG G AC TGG GTG CCA AAG CGC CCG TCG GTC TGC ACC ATG ACC 1351 
AAG TGG CAG GAG GTG GAC GAG ATG CTG CGC TCC GAG TAC GGC GGC 1396 
TCC TTC CGA TTC TCC TCC GAC GCC ATA TCC ACC ACC TTC ACC ACC 1441 
AAC CTG ACC GAG TAC CCG CTC TCG CGC GTG GAC CTG GGG GAC TGC 1486 
ATC GGC AAG GAC GCC CGC GAC GCC ATG GAC CGC ATC TTC GCC CGC 1531 
AGG TAC AAC GCG ACG CAC ATC AAG GTG GGC CAG CCG CAG TAC TAC 1576 
CTG GCC AAT GGG GGC TTT CTG ATC GCG TAC CAG CCC CTT CTC AGC 1621 
AAC ACG CTC GCG GAG CTG TAC GTG CGG GAA CAC CTC CGA GAG CAG 1666 
AGC CGC AAG CCC CCA AAC CCC ACG CCC CCG CCG CCC GGG GCC AGC 1711 
GCC AAC GCG TCC GTG GAG CGC ATC AAG ACC ACC TCC TCC ATC GAG 1756 
TTC GCC CGG CTG CAG TTT ACG TAC AAC CAC ATA CAG CGC CAT GTC 1801 
AAC GAT ATG TTG GGC CGC GTT GCC ATC GCG TGG TGC GAG CTG CAG 1846 
AAT CAC GAG CTG ACC CTG TGG AAC GAG GCC CGC AAG CTG AAC CCC 1891 
AAC GCC ATC GCC TCG GTC ACC GTG GGC CGG CGG GTG AGC GCG CGG 1936 
ATG CTC GGC GAC GTG ATG GCC GTC TCC ACG TGC GTG CCG GTC GCC 1981 
GCG GAC AAC GTG ATC GTC CAA AAC TCG ATG CGC ATC AGC TCG CGG 2026 
CCC GGG GCC TGC TAC AGC CGC CCC CTG GTC AGC TTT CGG TAC GAA 2071 
GAC CAG GGC CCG TTG GTC GAG GGG CAG CTG GGG GAG AAC AAC GAG 2116 
CTG CGG CTG ACG CGC GAT GCG ATC GAG CCG TGC ACC GTG GGA CAC 2161 
CGG CGC TAC TTC ACC TTC GGT GGG GGC TAC GTG TAC TTC GAG GAG 2206 
TAC GCG TAC TCC CAC CAG CTG AGC CGC GCC GAC ATC ACC ACC GTC 2251 
AGC ACC TTC ATC GAC CTC AAC ATC ACC ATG CTG GAG GAT CAC GAG 2296 
TTT GTC CCC CTG GAG GTG TAC ACC CGC CAC GAG ATC AAG GAC AGC 2341 
GGC CTG CTG GAC TAC ACG GAG GTC CAG CGC CGC AAC CAG CTG CAC 2386 
GAC CTG CGC TTC GCC GAC ATC GAC ACG GTC ATC CAC GCC GAC GCC 2431 
AAC GCC GCC ATG TTC GCG GGC CTG GGC GCG TTC TTC GAG GGG ATG 2476 
GGC GAC CTG GGG CGT GCG GTC GGC AAG GTG GTG ATG GGC ATC GTG 2521 
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GGC GGC GTG GTA TCO GCC GTG TCG GGC GTG TCC TCC TTC ATG TCC 2566 
AAC CCC TTT GGG GCG CTG GCC GTG GGT CTG TTG GTC CTG GCC GGC 2611 
CTG GCG GCG GCC TTC TTC GCC TTT CGC TAC GTC ATG CGG CTG CAG 2656 
AGC AAC CCC ATG AAG GCC CTG TAC CCG CTA ACC ACC AAG GAG CTC 2701 
AAG AAC CCC ACC AAC CCG G AC GCG TCC GGG GAG GGC GAG GAG GGC 2746 
GGC GAC TTT GAC GAG GCC AAG CTA GCC GAG GCC CGG .GAG ATG ATA 2791 
CGG TAC ATG GCC CTG GTG TCG GCC ATG GAG CAC ACG GAA CAC AAG 2836 
GCC AAG AAG AAG GGC ACG AGC GCG CTG CTT AGC GCC AAG GTC ACC 2881 
GAC ATG GTC ATG CGC AAG CGC CGC AAC ACC AAC TAC ACC CAA GTT 2926 

Stop 

CCC AAC AAA GAC AGT GAC GCC GAC GAG GAC GAC CTG TG A CGGGGG 2971 
GTTTGTTGTAAATAAAAACCACGGGTGTTAAACCGCATGCGCAtCTTTTGGT 3023 
TTTTTTTTTTGTACGCCCTTTGTGTGTGTGGG AAG AAAGAAAAAGGAACACA 3075 
TAAACTCCCCCGGGTGTCCGCGGCCTGTTTCCTCTTTCCTTTCCCGTGACAA 3127 
AACTGACCCCCTTGGTCAGTGCCGATTCCCCCCCCCCCCCCCACGCCTTCCT. 3179 
CCACGTCGAAGGCTTTTGTATTGTAAAGCTACCCGCCTACCCGCGCCTCCCA 3231 
ATAAAAAAAAAAAGAACATACACCAATGGGTCTTATTTGGTATTACCTGGTT 3283 
TATTTAAAAAGATATACAGTAAGACATCCCATGGT ACCAAAG ACCGGGGCGA 3335 
ATCAGCGGGCCCCCATCATCTGAG AG ACGAACAAATCGGCGGCGCGGGCCGT 3387 
GTCAACGTCCACGTGTGCTGCGCTGCTGGCGTTGACAAGGGCCCCGGCCTCC 3439 
GCGTTGGATGCCTCCGGTTGGGATCC 3465 
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8 6 H - / 

CTCGGAGAAGATGCTGCGGTCAGCGTCCACGGCGAGGTGCTGCCCGCGACGITCGCGGCG 60 
GTCGCCA ACGGCTTTGCGGCCGCGCGCCGGCTTCTGGCCGCCCCTGACGGCGGGCGCGGG 120 
CACGGTC AICGACAACCGGTCGGCGCCGGGCGTGTTCGACGCGGACCGGTTGATGCGAGC 180 
GTCTCTCCTGCGACACCAGGTGGACCCGGCCCTGCTCCCCACCATCACCCATCGCTTCTT 240 
CGAGCTGGTCA ACGGGCCCCTCTTTGATC ACTCCACCCACAGCTTCGCCCAGCCCCCCA A 300 
CACCGCGCTGTA TT ACAGCGTCGAGA ACGIGGGGCTCGTGGC6C ACCTGA AGGAGGAGCT 360 
CGCCCGGITCATC A IGGGGGCGGG6GGGTCGGGTGCTGATTGGGCCGTCAGCGA ATTTCA 420 
GAGGTTTTACTGTTTTGACGGCATTTCCGGAATAACGCCCACTCAGCGGGCCGCCTGGCG 480 
ATATATTC GCGAGCTGATTATCGCCACCACACTCTTTGCCTCG6TCTACCGGTGCGGGGA 540 
TGCGGCGCGCGGACTGCAGCCGCCCGACCTCCGA AGGTCGTT ACCGTTACCC 600 

660 
720 
779 



GCTCGAG1 

GCCCGGCGTA TATCTCACGT ACGACTCCGACTGTCCGCTGGTGGCCATCGtCGAGAGCGC 
CCCCGACGGCTGTATCGGCCCCCGGTCGGICGIGGTCTACGACCGAGACGITTTCTCGAT 
CCTCTACTCGGTCCTCCAGCACCTCGCCCCCAGGCTACCTGACGGGGGGCACGACGGGC 

HatArcGlnGlyAlaAlaArgGlyCysArgTrpPheValValTrp 

. . "tH*** .^ii"i";^/-r?.cr.RTcrrr:RTRr,TTCGTCGIATGG 
CCCCGTAGTCCCUtCA ibi,ui-i.»uuui.u<-vovu > — ■ 

™rPr.«^».l»!. J l.ii.t;r S l.«.^ ,|S 
ACGCCTC 



-15 
839 

6 

B99 



146 
1319 
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US 6 0 - 2 

AfiDLeuGluThrThrAlaPheHisArgAspAspHisGluThrAspMotGluLeuLysPro Z06 

aacctggaSaccaccccgu 1499 

k i B ifl n ii a iiaThrAr«ThrSer ArgGlyTrpHisThrThr AspLauLysTy r AsnPro 226 

SarArpValGliiAlaPheHisAreTyrGlyThrThrValAsnCysIleValGluGliiVal 246 

TMCGGGTGG.SgMT^ 1619 

AeoAlaAreSerValTyrProTyrAspGluPheValLauAlaThrGlyAspPheValTyr 266 

GACGCGCGCTCGGTGTACCCGTAGGACGAGTTTGTGC^ 1679 

MetSerProOheTyrGlyTyrArgGluAlySerHisThrGauHIsThrSerTyrAlaAla 286 

ATGTCCCCGTTTTACGGCTACCGGGAGGGGTCGCAGACCGAACACACCAGCTACGCCGCC 1739 

AapAraPhaLysGlnValAspGlyPheTyrAlaAreAspLouThrTyrLysAlaArgAla 30 6 

GACCGCTTCAAGCAGGTTGACGGCTTCTACGCGCGCGACCTCACCACCAAGGCCCGGGCC 1799 

ThrAJaProThrThrArgAsnLauLauThrThrProLysPheThrValAlaTrpAspTrp 326 

ACGGCGGCGACGAGCCGG AACGTGCTC ACGACCCCGA AGTTCACCGTGGGCIGG6ACTGG 18o9 

ValProLysArgProSorValCysThrMetThrLysTrpGlnGluValAspGluMetteu 346 

GTGCCA AAGCGCCCGTCGGTCTGCAGCATGACCA AGTGGCAGGAGGTGGACG AGATGGTG 1919 

ArgSerGluTyrGlyGlySerPheArePheSerSerAspAlalleSerThrThrPheThr 366 

CGCTCCGAGTACGGCGGCTCCITCCGATICTCCTCCGACGCCAIATCCACCACCTTCACC 1979 

203 6 9 



AlaArgAapAlaMatAspArgllePheAlaArgArglyrAanAlaThrHialleL^ 406 

GCCCGCGACGCCAIGGACCGCATCTTCGCCCGCAGGIACAACGCGACGCACATCAAGGTG 2099 

GlyGlnProGlnTyrTyrLeuAlaAsnGlyGlyPheLeuIleAlaTyrGlnProLeuLeu 4 26 

GGCCAGCCGCAGTACTACCTGGCCA ATGGGGGCTTICIGATCGCGTACCAGCCCCTTCTC 2159 

SerAsnThrUuAlaGluLeuTyrVaUrgGluHisLauArg^^ 446 

AGCAACACGCICGCGG AGCTGTACGIGCGGGAACACCICCGAGAGCAGAGCCGCAAGCCC 2219 

ProAsnProThrProProProProGlyAlaSerAlaAanAlaSerValGluArgU 466 

CCAAACCCCACGCGCGCGCCGCCCGGGGCCAGCGCC AACGCGTCC GTOGAGCGCATCAAG 2279 

ThrThrSerSerlleGluPheAlaArgLeuGlnPhrThrTyrAsnHisIleGlnArgHis 4 86 

IcCACCTCCTcSatSgASTTCGCCCGGCTGCAGTTTACGTACAACCACATAC 2339 

u= l t«niBnK»tL«uGlvArKValAlaIl«Al«TrpCyBGluLouGlnAsnHisGluLeu 506 
G?CUCGSl5?GnGGGCCGCG?^ 

ThrLeuTrDAsnGluAlaAPBLyaLeuAsnProAsnAlalleAlaSerAlaThrValGly 526 

ACCCTGTGGAACGAGGCCCGCAA^ " 59 

AreAreValSarAlaArgMatLeuGlyAspValMetAlaValSerThrCysValProVal 546 

cggcggg?gaScgcgcgga?gctcggcgacgt^ "is 

AlaAlaAspAsnVallleValGlnAsnSerMetArglXeSerSerArgProGlyAlaCys 5 66 

GCCGCGGACAACGTGATCGTCCA A AACTCG »™ 

T.„Ca«ip B Drni *.nVa 1 SerPheAraTyrGluAspGlnGlyProLeu ValGluGlyGln 586 

TAc!GCCGCCCCC?GG?CAGCl5TCGGnCG^ 2639 
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as b a - 3 



SidS^5^s{:s?i£i^Jss;i?isJ5s^!ir^;s"cisJ:Ei;sS'cs:?j *S5i 
^;sliss;ffijniis^,sajju^n^^?s;s;a^?sis? .Ill 

T.., U .iM.rAr.L.uGlnSerA Bn ProMatLys41aLettTyi^ ^786 
TicsiciTSCSCCTfiCiGlfiC4*CCCC4T8**aBCCCTHT»CB«ii»i»*».««* ( w..««-- 

^^AGCT*^ ™ 

^ y TuACGCCGA C G»GG:CGACCrGT34CGGGGGGITIGTT G I4A4T4AAA.CC4C 6 G G TG 
IT AAACCGCATGTGCAICTTTIGGTTTGTTTGITTGGIACGCCITTTGTGTGTGTGGG AA 
GAAAGAAAAGGGAACACATAAACTCCCCCGGGTGTCCGCGGCCTGTTTCCTCTTTCCTTT 
CCCGTGACAAAACGGACCCCCTTGGTCAGTGCCGAIICCCCCCCCACGCCTICCTCCACG 3719 
TCGAAGGCTTTTGCATTG7 A AAGCT ACCCGCCTACCCGCGCCICCCAATAA AAAAAGAAC 
ATACACCAATGGGTCTTATTTGGTATTACCTGGTTTAITTAA AA AGAT AIACAGTAAGAC 3339 
ATCCCATGGTACCAA AGACCGGGGCGA ATCAGCGGGCCCCCAICATCTGAGAGACGAACA 
A ATCGGCGGCGCGGGCCGTGTCAACGICCACGTGTGCTGCGCTGCIGGCGTTGACA AGGG 3959 
CCCCGGCCTCCGCGTTGGATGCCTCCGGTTGGGAICC 3996 



866 
479 

874 
3S99 
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pHSG3970) 



pHSG396SgD 

|-Xh o I -Xb a I 
1. 35kb fragment 

j-Hi n f I 
0. 91kb fragment 

g D terminator adaptor(12bp DNA fragment) 

-/ 



|aTT CGGCTTAGTA AG AGC1 
Ktnfl |GCCGAATCATTC| Sacl 
3' 5' 



5 HSG39 7SgDAHinf 
Hi Xh 

truncated 
ED 
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m I O 



Hintl^^truncated 



Hint! 
EcoRI 




HintI 



Hinfl 
0.95kb 



Hinfl 



imc wi 



J k I enow 
| Nhel linker 

\ h V S >^ 
.iMiiiiin Nhel 



EcoRI 



| EcoRI 



Nhel 



0.9Wb 
I 




HlndB 



\ SacI 
^ T4 polymerase 

\ Nhel linker 



Nh^EDral 
Kindl^^^SST^ Hindffl 



UJ EcoRI J, Sgtl linker 

r Clal 
_ , r \ BglH.Nhel Sail 

B 9 11 NheI_!_Bg(I 



5VA0 



J 0-32kb 



SalI,Clar 





EcoRI 

LThB 

Hindlll'CSSS^^ Nhel 
Clal 

1 Sail. Hindi 
EcoRI Nhel Hindi! 
HhVsSYl H — 



/ Sail. Hind I 
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^ 1 1 EI 

1 ATG GGG GGG GCT GCC GCC AGG TTG GGG GCC GTG ATT TT6 TTT GTC 

46 GTC ATA GTG GGC CTC CAT GGG GTC CGC GGC AAA TAT GCC TTG GCG 

91 GAT GCC TCT CTC A AG ATG GCC GAC CCC AAT CGC TTT CGC GGC AAA 

136 GAC CTT CCG GTC CTG GAC CCG CTG ACC GAC CCT CCG GGG GTC CG6 

181 CGC GTG TAC CAC ATC CAG GCG GGC CTA CCG GAC CCG TTC CAG CCC 

226 CCC AGC CTC CCG ATC ACG GTT TAC TAC GCC GTG TTG GAG CGC GCC 

Z71 TGC CGC AGC GTG CTC CTA AAC GCA CCG TCG GAG GCC CCC CAG ATT 

316 GTC CGC GGG GCC TCC GAAGAC GTC CGG AAA CAA CCC TAC AAC CTG 

361 ACC ATC GCT TGG TTT CGG ATG GGA GGC AAC TGT GCT ATC CCC ATC 

406 ACG GTC ATG GAG TAC ACC GAA TGC TCC TAC AAC A A G TCT CTG GGG 

451 GCC TGT CCC ATC CGA ACG CAG CCC CGC TGG AAC TAC TAT GAC AGC 

496 TTC AGC GCC GTC AGC GAG GAT AAC CTG GGG TTC CTG ATG CAC GCC 

541 CCC GCG TTT GAG ACC GCC GGC ACG TAC CTG CGG CTC GTG AAG ATA 

5S6 AAC GAC TGG ACG GAG ATT ACA CAG TTT ATC CTG GAG CAC CGA GCC 

631 AAG GGC TCC TGT AAG TAC GCC CTC CCG CTG CGC ATC CCC CCG TCA 

676 GCC TGC CTC TCC CCC CAG GCC TAC CAG CAG GGG GTG ACG GTG GAC 

721 AGC ATC GGG ATG CTG CCC CGC TTC ATC CCC GAG AAC CAG CGC ACC 

766 GTC GCC GTA TAC AGC TTG AAG ATC GCC GGG TGG CAC GGG CCC AAG 

ail GCC CCA TAC ACG AGC ACC CTG CTG CCC CCT GAG CTG TCC GAG ACC 

356 CCC AAC GCC ACG CAG CCA GAA CTC GCC CCG GAA GAC CCC GAG GAT 

901 TCG CTA GCG CCC CAG TCT ATT ACA GAA CTA TGT TCG GAA TAT CGC 

946 AAC ACA CAA ATA TAT ACG ATA AAT GAC AAG ATA CTA TCA TAT ACG 

991 GAA TCG ATG GCA GGC AAA AGA GAA ATG GTT ATC ATT ACA TTT AAG 

1036 AGC GGC GCA ACA TTT CAG GTC GAA GTC CCG GGC AGT CAA CAT ATA 

1081 GAC TCC CAA AAA AAA GCC ATT GAA AGG ATG AAG GAC ACA TTA AGA 

1126 ATC ACA TAT CTG ACC GAG ACC AAA ATT GAT AAA TTA TGT GTA TGG 

1171 AAT A AT AAA ACC CCC AAT TCA ATT GCG GCA ATC AGT ATG GAA AAC 

1216 TAG 
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^12 0 

1 Met Gly Gly Ala Ala Ala Arg Leu Gly Ala Val He Leu Phe Val 
16 Val lie Val Gly Leu Bis Gly Val Arg Gly Lys Tyr Ala Leu Ala 
31 Asp Ala Ser Leu Lya Wet Ala Asp Pro Asn Arg Phe Arg Gly Lys 
46 Asp Leu Pro Vol Leu Asp Pro Leu Thr Asp Pro Pro Gly Val Arg 
61 Arg Val Tyr His He Gin Ala Gly Leu Pro Asp Pro Phe Gin Pro 
76 Pro Ser Leu Pro He Thr Val Tyr Tyr Ala Val Leu Glu Arg Ala 
Gl Cys Arg Ser Val Leu Leu Asn Ala Pro Ser Glu Ala Pro Gin He 
106 Val Arg Gly Ala Ser Glu Asp Val Arg Lys Gin Pro Tyr Asn Leu 
121 Thr lie Ala Tr P Phe Arg Met Gly Gly Asn Cys Ala He Pro He 
136 Thr Val Met Glu Tyr Thr Glu Cys Ser Tyr Asn Lys Ser Leu Gly 
151 Ala Cys Pro He Arc Thr Gin Pro Arg Trp Asn Tyr Tyr Asp Ser 
166 Phe Ser Ala Val Ser Glu Asp Asn Leu Gly Phe Leu Wet His Ala 
181 Pro Ala Phe Glu Thr Ala Gly Thr Tyr Leu Arg Leu Val Lys He 
196 Asn Asp Trp Thr Glu He Thr Gin Phe He Leu Glu His Arg Ala 
211 Lys Gly Ser Cys Lys Tyr Ala Leu Pro Leu Arg He Pro Pro Ser 
226 Ala Cys Leu Ser Pro Gin Ala Tyr Gin Gin Gly Val Thr Val Asp 
241 Ser He Gly Met Leu Pro Arg Phe He Pro Glu Asn Gin Arg Thr 
256 Val Ala Val Tyr Ser Leu Lys He Ala Gly Trp His Gly Pro Lys 
271 Ala Pro Tyr Thr Ser Thr Leu Leu Pro Pro Glu Leu Ser Glu Thr 
286 Pro Asn Ala Thr Gin Pro Glu Leu Ala Pro Glu Asp Pro Glu Asp 
301 Ser Leu Ala Pro Gin Ser He Thr Glu Leu Cys Ser Glu Tyr Arg 
316 Asn Thr Gin He Tyr Thr lie Asn Asp Lys He Leu Ser Tyr Thr 
331 Glu Ser Met Ala Gly Lys Arg Glu Met Val He He Thr Phe Lys 
346 Ser Gly Ala Thr Phe Gin Val Glu Val Pro Gly Ser Gin His He 
361 Asp Ser Gin Lys Lys Ale He Glu Arg Met Lys Asp Thr Leu Arg 
376 He Thr Tyr Leu Thr Glu Thr Lys He Asp Lys Leu Cys Val Trp 
391 Asn Asn Lys Thr Pro Asn Ser He Ala Ala He Ser Met Glu Asn * 
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